Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 641914 |
| Missing cells | 8758 |
| Missing cells (%) | 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 482.2 MiB |
| Average record size in memory | 787.7 B |
Variable types
| CAT | 11 |
|---|---|
| NUM | 9 |
| BOOL | 3 |
Reproduction
| Analysis started | 2020-05-29 21:04:26.547806 |
|---|---|
| Analysis finished | 2020-05-29 21:07:48.922473 |
| Duration | 3 minutes and 22.37 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
transactionDateTime has a high cardinality: 635472 distinct values | High cardinality |
merchantName has a high cardinality: 2493 distinct values | High cardinality |
currentExpDate has a high cardinality: 165 distinct values | High cardinality |
accountOpenDate has a high cardinality: 1826 distinct values | High cardinality |
dateOfLastAddressChange has a high cardinality: 2186 distinct values | High cardinality |
customerId is highly correlated with accountNumber | High correlation |
accountNumber is highly correlated with customerId | High correlation |
enteredCVV is highly correlated with cardCVV | High correlation |
cardCVV is highly correlated with enteredCVV | High correlation |
merchantCountryCode is highly correlated with acqCountry | High correlation |
acqCountry is highly correlated with merchantCountryCode | High correlation |
transactionDateTime is uniformly distributed | Uniform |
transactionAmount has 18479 (2.9%) zeros | Zeros |
cardLast4Digits has 6727 (1.0%) zeros | Zeros |
currentBalance has 33622 (5.2%) zeros | Zeros |
| Distinct count | 5000 |
|---|---|
| Unique (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 554770145.8938737 |
|---|---|
| Minimum | 100547107 |
| Maximum | 999985343 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.9 MiB |
Quantile statistics
| Minimum | 100547107 |
|---|---|
| 5-th percentile | 162363014 |
| Q1 | 322319158 |
| median | 543887911 |
| Q3 | 786227686 |
| 95-th percentile | 947027654 |
| Maximum | 999985343 |
| Range | 899438236 |
| Interquartile range (IQR) | 463908528 |
Descriptive statistics
| Standard deviation | 254688449 |
|---|---|
| Coefficient of variation (CV) | 0.4590882385 |
| Kurtosis | -1.25930888 |
| Mean | 554770145.9 |
| Median Absolute Deviation (MAD) | 226867261 |
| Skewness | 0.02824548133 |
| Sum | 3.561147234e+14 |
| Variance | 6.486620607e+16 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 318001076 | 10034 | 1.6% | |
| 456044564 | 8382 | 1.3% | |
| 812328116 | 5494 | 0.9% | |
| 838085703 | 5129 | 0.8% | |
| 239875038 | 4705 | 0.7% | |
| 877017103 | 4435 | 0.7% | |
| 278064853 | 4227 | 0.7% | |
| 353215513 | 3756 | 0.6% | |
| 314506271 | 3410 | 0.5% | |
| 917216469 | 3258 | 0.5% | |
| 822203001 | 3046 | 0.5% | |
| 412558887 | 3044 | 0.5% | |
| 901922840 | 3021 | 0.5% | |
| 428892294 | 2983 | 0.5% | |
| 235721673 | 2687 | 0.4% | |
| 772212779 | 2613 | 0.4% | |
| 990764813 | 2594 | 0.4% | |
| 226896970 | 2387 | 0.4% | |
| 520717889 | 2324 | 0.4% | |
| 289059209 | 2275 | 0.4% | |
| 792317293 | 2205 | 0.3% | |
| 832350956 | 2084 | 0.3% | |
| 377838194 | 1978 | 0.3% | |
| 484705396 | 1920 | 0.3% | |
| 719873381 | 1917 | 0.3% | |
| Other values (4975) | 552006 | 86.0% |
| Value | Count | Frequency (%) | |
| 100547107 | 85 | < 0.1% | |
| 100634414 | 24 | < 0.1% | |
| 100973869 | 46 | < 0.1% | |
| 101192712 | 20 | < 0.1% | |
| 101548993 | 29 | < 0.1% | |
| 101660233 | 46 | < 0.1% | |
| 101680180 | 41 | < 0.1% | |
| 101754476 | 31 | < 0.1% | |
| 101970909 | 27 | < 0.1% | |
| 102085969 | 27 | < 0.1% |
| Value | Count | Frequency (%) | |
| 999985343 | 104 | < 0.1% | |
| 999984515 | 32 | < 0.1% | |
| 999789077 | 72 | < 0.1% | |
| 999275549 | 230 | < 0.1% | |
| 999273501 | 8 | < 0.1% | |
| 999246377 | 43 | < 0.1% | |
| 999116200 | 295 | < 0.1% | |
| 998837644 | 32 | < 0.1% | |
| 998480579 | 27 | < 0.1% | |
| 998034300 | 31 | < 0.1% |
| Distinct count | 5000 |
|---|---|
| Unique (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 554770145.8938737 |
|---|---|
| Minimum | 100547107 |
| Maximum | 999985343 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.9 MiB |
Quantile statistics
| Minimum | 100547107 |
|---|---|
| 5-th percentile | 162363014 |
| Q1 | 322319158 |
| median | 543887911 |
| Q3 | 786227686 |
| 95-th percentile | 947027654 |
| Maximum | 999985343 |
| Range | 899438236 |
| Interquartile range (IQR) | 463908528 |
Descriptive statistics
| Standard deviation | 254688449 |
|---|---|
| Coefficient of variation (CV) | 0.4590882385 |
| Kurtosis | -1.25930888 |
| Mean | 554770145.9 |
| Median Absolute Deviation (MAD) | 226867261 |
| Skewness | 0.02824548133 |
| Sum | 3.561147234e+14 |
| Variance | 6.486620607e+16 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 318001076 | 10034 | 1.6% | |
| 456044564 | 8382 | 1.3% | |
| 812328116 | 5494 | 0.9% | |
| 838085703 | 5129 | 0.8% | |
| 239875038 | 4705 | 0.7% | |
| 877017103 | 4435 | 0.7% | |
| 278064853 | 4227 | 0.7% | |
| 353215513 | 3756 | 0.6% | |
| 314506271 | 3410 | 0.5% | |
| 917216469 | 3258 | 0.5% | |
| 822203001 | 3046 | 0.5% | |
| 412558887 | 3044 | 0.5% | |
| 901922840 | 3021 | 0.5% | |
| 428892294 | 2983 | 0.5% | |
| 235721673 | 2687 | 0.4% | |
| 772212779 | 2613 | 0.4% | |
| 990764813 | 2594 | 0.4% | |
| 226896970 | 2387 | 0.4% | |
| 520717889 | 2324 | 0.4% | |
| 289059209 | 2275 | 0.4% | |
| 792317293 | 2205 | 0.3% | |
| 832350956 | 2084 | 0.3% | |
| 377838194 | 1978 | 0.3% | |
| 484705396 | 1920 | 0.3% | |
| 719873381 | 1917 | 0.3% | |
| Other values (4975) | 552006 | 86.0% |
| Value | Count | Frequency (%) | |
| 100547107 | 85 | < 0.1% | |
| 100634414 | 24 | < 0.1% | |
| 100973869 | 46 | < 0.1% | |
| 101192712 | 20 | < 0.1% | |
| 101548993 | 29 | < 0.1% | |
| 101660233 | 46 | < 0.1% | |
| 101680180 | 41 | < 0.1% | |
| 101754476 | 31 | < 0.1% | |
| 101970909 | 27 | < 0.1% | |
| 102085969 | 27 | < 0.1% |
| Value | Count | Frequency (%) | |
| 999985343 | 104 | < 0.1% | |
| 999984515 | 32 | < 0.1% | |
| 999789077 | 72 | < 0.1% | |
| 999275549 | 230 | < 0.1% | |
| 999273501 | 8 | < 0.1% | |
| 999246377 | 43 | < 0.1% | |
| 999116200 | 295 | < 0.1% | |
| 998837644 | 32 | < 0.1% | |
| 998480579 | 27 | < 0.1% | |
| 998034300 | 31 | < 0.1% |
creditLimit
Real number (ℝ≥0)
| Distinct count | 10 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10697.210607651492 |
|---|---|
| Minimum | 250 |
| Maximum | 50000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.9 MiB |
Quantile statistics
| Minimum | 250 |
|---|---|
| 5-th percentile | 500 |
| Q1 | 5000 |
| median | 7500 |
| Q3 | 15000 |
| 95-th percentile | 50000 |
| Maximum | 50000 |
| Range | 49750 |
| Interquartile range (IQR) | 10000 |
Descriptive statistics
| Standard deviation | 11460.35913 |
|---|---|
| Coefficient of variation (CV) | 1.07134089 |
| Kurtosis | 5.364802458 |
| Mean | 10697.21061 |
| Median Absolute Deviation (MAD) | 5000 |
| Skewness | 2.294548813 |
| Sum | 6866689250 |
| Variance | 131339831.5 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 5000 | 127001 | 19.8% | |
| 7500 | 105340 | 16.4% | |
| 15000 | 91936 | 14.3% | |
| 10000 | 67477 | 10.5% | |
| 20000 | 64307 | 10.0% | |
| 2500 | 59421 | 9.3% | |
| 50000 | 38039 | 5.9% | |
| 500 | 32751 | 5.1% | |
| 1000 | 27861 | 4.3% | |
| 250 | 27781 | 4.3% |
| Value | Count | Frequency (%) | |
| 250 | 27781 | 4.3% | |
| 500 | 32751 | 5.1% | |
| 1000 | 27861 | 4.3% | |
| 2500 | 59421 | 9.3% | |
| 5000 | 127001 | 19.8% | |
| 7500 | 105340 | 16.4% | |
| 10000 | 67477 | 10.5% | |
| 15000 | 91936 | 14.3% | |
| 20000 | 64307 | 10.0% | |
| 50000 | 38039 | 5.9% |
| Value | Count | Frequency (%) | |
| 50000 | 38039 | 5.9% | |
| 20000 | 64307 | 10.0% | |
| 15000 | 91936 | 14.3% | |
| 10000 | 67477 | 10.5% | |
| 7500 | 105340 | 16.4% | |
| 5000 | 127001 | 19.8% | |
| 2500 | 59421 | 9.3% | |
| 1000 | 27861 | 4.3% | |
| 500 | 32751 | 5.1% | |
| 250 | 27781 | 4.3% |
availableMoney
Real number (ℝ)
| Distinct count | 450690 |
|---|---|
| Unique (%) | 70.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6652.828572659265 |
|---|---|
| Minimum | -1244.93 |
| Maximum | 50000.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.9 MiB |
Quantile statistics
| Minimum | -1244.93 |
|---|---|
| 5-th percentile | 164.68 |
| Q1 | 1114.97 |
| median | 3578.165 |
| Q3 | 8169.185 |
| 95-th percentile | 19992.9255 |
| Maximum | 50000 |
| Range | 51244.93 |
| Interquartile range (IQR) | 7054.215 |
Descriptive statistics
| Standard deviation | 9227.132275 |
|---|---|
| Coefficient of variation (CV) | 1.38694875 |
| Kurtosis | 9.474952535 |
| Mean | 6652.828573 |
| Median Absolute Deviation (MAD) | 2912.235 |
| Skewness | 2.888834825 |
| Sum | 4270543800 |
| Variance | 85139970.03 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 5000 | 5236 | 0.8% | |
| 250 | 5119 | 0.8% | |
| 7500 | 4309 | 0.7% | |
| 15000 | 4239 | 0.7% | |
| 500 | 3391 | 0.5% | |
| 10000 | 2861 | 0.4% | |
| 2500 | 2719 | 0.4% | |
| 20000 | 2671 | 0.4% | |
| 1000 | 1761 | 0.3% | |
| 50000 | 1317 | 0.2% | |
| 214.29 | 15 | < 0.1% | |
| 4993.97 | 15 | < 0.1% | |
| 228.63 | 15 | < 0.1% | |
| 7459.52 | 15 | < 0.1% | |
| 460.59 | 15 | < 0.1% | |
| 4954.47 | 13 | < 0.1% | |
| 244.77 | 13 | < 0.1% | |
| 4908.94 | 12 | < 0.1% | |
| 19940.89 | 12 | < 0.1% | |
| 4863.41 | 12 | < 0.1% | |
| 2446.14 | 12 | < 0.1% | |
| 14946.48 | 11 | < 0.1% | |
| 7470.28 | 11 | < 0.1% | |
| 212.44 | 11 | < 0.1% | |
| 174.88 | 11 | < 0.1% | |
| Other values (450665) | 608098 | 94.7% |
| Value | Count | Frequency (%) | |
| -1244.93 | 1 | < 0.1% | |
| -1112.12 | 1 | < 0.1% | |
| -1027.96 | 1 | < 0.1% | |
| -973.67 | 1 | < 0.1% | |
| -904.64 | 1 | < 0.1% | |
| -894.64 | 1 | < 0.1% | |
| -875.73 | 1 | < 0.1% | |
| -856.54 | 1 | < 0.1% | |
| -855.97 | 1 | < 0.1% | |
| -815.27 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 50000 | 1317 | 0.2% | |
| 49999.94 | 1 | < 0.1% | |
| 49999.49 | 1 | < 0.1% | |
| 49999.43 | 2 | < 0.1% | |
| 49999.36 | 1 | < 0.1% | |
| 49999.26 | 1 | < 0.1% | |
| 49999.2 | 1 | < 0.1% | |
| 49998.72 | 1 | < 0.1% | |
| 49998.63 | 1 | < 0.1% | |
| 49998.55 | 1 | < 0.1% |
| Distinct count | 635472 |
|---|---|
| Unique (%) | 99.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.9 MiB |
| 2016-12-30T09:23:08 | 3 |
|---|---|
| 2016-07-19T13:03:15 | 3 |
| 2016-08-07T03:58:57 | 3 |
| 2016-06-13T06:30:14 | 3 |
| 2016-11-27T14:24:54 | 3 |
| Other values (635467) |
| Value | Count | Frequency (%) | |
| 2016-12-30T09:23:08 | 3 | < 0.1% | |
| 2016-07-19T13:03:15 | 3 | < 0.1% | |
| 2016-08-07T03:58:57 | 3 | < 0.1% | |
| 2016-06-13T06:30:14 | 3 | < 0.1% | |
| 2016-11-27T14:24:54 | 3 | < 0.1% | |
| 2016-07-16T16:57:41 | 3 | < 0.1% | |
| 2016-04-04T20:22:12 | 3 | < 0.1% | |
| 2016-06-15T22:46:39 | 3 | < 0.1% | |
| 2016-03-16T19:13:52 | 3 | < 0.1% | |
| 2016-01-19T04:26:56 | 3 | < 0.1% | |
| 2016-04-13T19:43:57 | 3 | < 0.1% | |
| 2016-10-03T17:30:49 | 3 | < 0.1% | |
| 2016-10-18T18:39:13 | 3 | < 0.1% | |
| 2016-01-25T11:27:23 | 3 | < 0.1% | |
| 2016-04-28T08:16:31 | 3 | < 0.1% | |
| 2016-07-21T12:04:45 | 3 | < 0.1% | |
| 2016-10-20T13:20:12 | 3 | < 0.1% | |
| 2016-03-30T13:11:16 | 3 | < 0.1% | |
| 2016-11-27T12:15:24 | 3 | < 0.1% | |
| 2016-12-08T08:33:30 | 3 | < 0.1% | |
| 2016-08-07T22:06:11 | 3 | < 0.1% | |
| 2016-06-14T22:02:40 | 3 | < 0.1% | |
| 2016-01-07T09:56:55 | 3 | < 0.1% | |
| 2016-12-25T20:18:11 | 3 | < 0.1% | |
| 2016-12-26T08:59:23 | 3 | < 0.1% | |
| Other values (635447) | 641839 | > 99.9% |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 2111188 | 17.3% | |
| 1 | 1893675 | 15.5% | |
| 2 | 1550066 | 12.7% | |
| - | 1283828 | 10.5% | |
| : | 1283828 | 10.5% | |
| 6 | 940073 | 7.7% | |
| T | 641914 | 5.3% | |
| 3 | 568369 | 4.7% | |
| 5 | 513084 | 4.2% | |
| 4 | 510408 | 4.2% | |
| 8 | 300199 | 2.5% | |
| 7 | 300121 | 2.5% | |
| 9 | 299613 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 8986796 | 73.7% | |
| Dash Punctuation | 1283828 | 10.5% | |
| Other Punctuation | 1283828 | 10.5% | |
| Uppercase Letter | 641914 | 5.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 2111188 | 23.5% | |
| 1 | 1893675 | 21.1% | |
| 2 | 1550066 | 17.2% | |
| 6 | 940073 | 10.5% | |
| 3 | 568369 | 6.3% | |
| 5 | 513084 | 5.7% | |
| 4 | 510408 | 5.7% | |
| 8 | 300199 | 3.3% | |
| 7 | 300121 | 3.3% | |
| 9 | 299613 | 3.3% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 1283828 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| T | 641914 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| : | 1283828 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 11554452 | 94.7% | |
| Latin | 641914 | 5.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 2111188 | 18.3% | |
| 1 | 1893675 | 16.4% | |
| 2 | 1550066 | 13.4% | |
| - | 1283828 | 11.1% | |
| : | 1283828 | 11.1% | |
| 6 | 940073 | 8.1% | |
| 3 | 568369 | 4.9% | |
| 5 | 513084 | 4.4% | |
| 4 | 510408 | 4.4% | |
| 8 | 300199 | 2.6% | |
| 7 | 300121 | 2.6% | |
| 9 | 299613 | 2.6% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| T | 641914 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 12196366 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 2111188 | 17.3% | |
| 1 | 1893675 | 15.5% | |
| 2 | 1550066 | 12.7% | |
| - | 1283828 | 10.5% | |
| : | 1283828 | 10.5% | |
| 6 | 940073 | 7.7% | |
| T | 641914 | 5.3% | |
| 3 | 568369 | 4.7% | |
| 5 | 513084 | 4.2% | |
| 4 | 510408 | 4.2% | |
| 8 | 300199 | 2.5% | |
| 7 | 300121 | 2.5% | |
| 9 | 299613 | 2.5% |
| Distinct count | 62735 |
|---|---|
| Unique (%) | 9.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 135.16249698557746 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1825.25 |
| Zeros | 18479 |
| Zeros (%) | 2.9% |
| Memory size | 4.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3.41 |
| Q1 | 32.32 |
| median | 85.8 |
| Q3 | 189.03 |
| 95-th percentile | 430.77 |
| Maximum | 1825.25 |
| Range | 1825.25 |
| Interquartile range (IQR) | 156.71 |
Descriptive statistics
| Standard deviation | 147.0533021 |
|---|---|
| Coefficient of variation (CV) | 1.087974145 |
| Kurtosis | 6.367300632 |
| Mean | 135.162497 |
| Median Absolute Deviation (MAD) | 65.45 |
| Skewness | 2.095715154 |
| Sum | 86762699.09 |
| Variance | 21624.67364 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 18479 | 2.9% | |
| 3.42 | 120 | < 0.1% | |
| 4.94 | 119 | < 0.1% | |
| 8.78 | 118 | < 0.1% | |
| 3.56 | 113 | < 0.1% | |
| 7.91 | 112 | < 0.1% | |
| 33.79 | 112 | < 0.1% | |
| 8.49 | 110 | < 0.1% | |
| 5.1 | 108 | < 0.1% | |
| 53.86 | 104 | < 0.1% | |
| 3.47 | 102 | < 0.1% | |
| 42.3 | 101 | < 0.1% | |
| 5.84 | 101 | < 0.1% | |
| 4.55 | 100 | < 0.1% | |
| 8.57 | 99 | < 0.1% | |
| 6.44 | 98 | < 0.1% | |
| 7.68 | 98 | < 0.1% | |
| 5.33 | 97 | < 0.1% | |
| 8.66 | 96 | < 0.1% | |
| 4.25 | 96 | < 0.1% | |
| 39.37 | 96 | < 0.1% | |
| 8.21 | 96 | < 0.1% | |
| 7.87 | 95 | < 0.1% | |
| 54.2 | 95 | < 0.1% | |
| 45.53 | 95 | < 0.1% | |
| Other values (62710) | 620954 | 96.7% |
| Value | Count | Frequency (%) | |
| 0 | 18479 | 2.9% | |
| 0.01 | 35 | < 0.1% | |
| 0.02 | 44 | < 0.1% | |
| 0.03 | 39 | < 0.1% | |
| 0.04 | 36 | < 0.1% | |
| 0.05 | 32 | < 0.1% | |
| 0.06 | 45 | < 0.1% | |
| 0.07 | 41 | < 0.1% | |
| 0.08 | 36 | < 0.1% | |
| 0.09 | 41 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1825.25 | 1 | < 0.1% | |
| 1760.36 | 1 | < 0.1% | |
| 1743.51 | 1 | < 0.1% | |
| 1692.93 | 1 | < 0.1% | |
| 1687.48 | 1 | < 0.1% | |
| 1655.07 | 1 | < 0.1% | |
| 1633.89 | 1 | < 0.1% | |
| 1598.94 | 1 | < 0.1% | |
| 1574.73 | 1 | < 0.1% | |
| 1571.81 | 1 | < 0.1% |
| Distinct count | 2493 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.9 MiB |
| Lyft | 25311 |
|---|---|
| Uber | 25263 |
| gap.com | 13824 |
| apple.com | 13607 |
| target.com | 13601 |
| Other values (2488) |
| Value | Count | Frequency (%) | |
| Lyft | 25311 | 3.9% | |
| Uber | 25263 | 3.9% | |
| gap.com | 13824 | 2.2% | |
| apple.com | 13607 | 2.1% | |
| target.com | 13601 | 2.1% | |
| alibaba.com | 13583 | 2.1% | |
| staples.com | 13512 | 2.1% | |
| amazon.com | 13477 | 2.1% | |
| ebay.com | 13472 | 2.1% | |
| discount.com | 13394 | 2.1% | |
| oldnavy.com | 13381 | 2.1% | |
| walmart.com | 13282 | 2.1% | |
| sears.com | 13279 | 2.1% | |
| cheapfast.com | 13057 | 2.0% | |
| Apple iTunes | 7579 | 1.2% | |
| Play Store | 7035 | 1.1% | |
| Mobile eCards | 4169 | 0.6% | |
| Blue Mountain eCards | 4165 | 0.6% | |
| Blue Mountain Online Services | 4149 | 0.6% | |
| Fresh eCards | 4147 | 0.6% | |
| Next Day Online Services | 4135 | 0.6% | |
| Fresh Flowers | 4112 | 0.6% | |
| Next Day eCards | 4084 | 0.6% | |
| Fresh Online Services | 4084 | 0.6% | |
| AMC #606218 | 3371 | 0.5% | |
| Other values (2468) | 378841 | 59.0% |
Length
| Max length | 30 |
|---|---|
| Median length | 13 |
| Mean length | 13.8828893 |
| Min length | 4 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 675677 | 7.6% | ||
| e | 530920 | 6.0% | |
| a | 514000 | 5.8% | |
| o | 424806 | 4.8% | |
| t | 418174 | 4.7% | |
| s | 417630 | 4.7% | |
| n | 351359 | 3.9% | |
| # | 287613 | 3.2% | |
| i | 279424 | 3.1% | |
| c | 264733 | 3.0% | |
| r | 256192 | 2.9% | |
| m | 239687 | 2.7% | |
| l | 229032 | 2.6% | |
| u | 209178 | 2.3% | |
| 1 | 192937 | 2.2% | |
| 4 | 186127 | 2.1% | |
| 2 | 185984 | 2.1% | |
| 3 | 176825 | 2.0% | |
| 9 | 174391 | 2.0% | |
| . | 173569 | 1.9% | |
| 6 | 173525 | 1.9% | |
| 5 | 167214 | 1.9% | |
| 8 | 166925 | 1.9% | |
| 7 | 149724 | 1.7% | |
| y | 139156 | 1.6% | |
| Other values (39) | 1926819 | 21.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 5014435 | 56.3% | |
| Decimal Number | 1710194 | 19.2% | |
| Uppercase Letter | 999938 | 11.2% | |
| Space Separator | 675677 | 7.6% | |
| Other Punctuation | 500040 | 5.6% | |
| Dash Punctuation | 11337 | 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| S | 100547 | 10.1% | |
| P | 95370 | 9.5% | |
| C | 87483 | 8.7% | |
| A | 75059 | 7.5% | |
| M | 69720 | 7.0% | |
| B | 69194 | 6.9% | |
| D | 57062 | 5.7% | |
| F | 50388 | 5.0% | |
| U | 41324 | 4.1% | |
| R | 36966 | 3.7% | |
| H | 35145 | 3.5% | |
| E | 34870 | 3.5% | |
| G | 29765 | 3.0% | |
| W | 28030 | 2.8% | |
| L | 27285 | 2.7% | |
| N | 26777 | 2.7% | |
| Z | 26595 | 2.7% | |
| T | 23443 | 2.3% | |
| O | 21373 | 2.1% | |
| K | 20439 | 2.0% | |
| Q | 17157 | 1.7% | |
| I | 15484 | 1.5% | |
| Y | 4302 | 0.4% | |
| V | 3507 | 0.4% | |
| J | 2653 | 0.3% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 530920 | 10.6% | |
| a | 514000 | 10.3% | |
| o | 424806 | 8.5% | |
| t | 418174 | 8.3% | |
| s | 417630 | 8.3% | |
| n | 351359 | 7.0% | |
| i | 279424 | 5.6% | |
| c | 264733 | 5.3% | |
| r | 256192 | 5.1% | |
| m | 239687 | 4.8% | |
| l | 229032 | 4.6% | |
| u | 209178 | 4.2% | |
| y | 139156 | 2.8% | |
| b | 107026 | 2.1% | |
| p | 106202 | 2.1% | |
| h | 97778 | 1.9% | |
| d | 95072 | 1.9% | |
| g | 73696 | 1.5% | |
| w | 65525 | 1.3% | |
| f | 52139 | 1.0% | |
| v | 45074 | 0.9% | |
| z | 42039 | 0.8% | |
| k | 40360 | 0.8% | |
| x | 15233 | 0.3% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 675677 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| # | 287613 | 57.5% | |
| . | 173569 | 34.7% | |
| ' | 38858 | 7.8% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 192937 | 11.3% | |
| 4 | 186127 | 10.9% | |
| 2 | 185984 | 10.9% | |
| 3 | 176825 | 10.3% | |
| 9 | 174391 | 10.2% | |
| 6 | 173525 | 10.1% | |
| 5 | 167214 | 9.8% | |
| 8 | 166925 | 9.8% | |
| 7 | 149724 | 8.8% | |
| 0 | 136542 | 8.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 11337 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 6014373 | 67.5% | |
| Common | 2897248 | 32.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 530920 | 8.8% | |
| a | 514000 | 8.5% | |
| o | 424806 | 7.1% | |
| t | 418174 | 7.0% | |
| s | 417630 | 6.9% | |
| n | 351359 | 5.8% | |
| i | 279424 | 4.6% | |
| c | 264733 | 4.4% | |
| r | 256192 | 4.3% | |
| m | 239687 | 4.0% | |
| l | 229032 | 3.8% | |
| u | 209178 | 3.5% | |
| y | 139156 | 2.3% | |
| b | 107026 | 1.8% | |
| p | 106202 | 1.8% | |
| S | 100547 | 1.7% | |
| h | 97778 | 1.6% | |
| P | 95370 | 1.6% | |
| d | 95072 | 1.6% | |
| C | 87483 | 1.5% | |
| A | 75059 | 1.2% | |
| g | 73696 | 1.2% | |
| M | 69720 | 1.2% | |
| B | 69194 | 1.2% | |
| w | 65525 | 1.1% | |
| Other values (24) | 697410 | 11.6% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 675677 | 23.3% | ||
| # | 287613 | 9.9% | |
| 1 | 192937 | 6.7% | |
| 4 | 186127 | 6.4% | |
| 2 | 185984 | 6.4% | |
| 3 | 176825 | 6.1% | |
| 9 | 174391 | 6.0% | |
| . | 173569 | 6.0% | |
| 6 | 173525 | 6.0% | |
| 5 | 167214 | 5.8% | |
| 8 | 166925 | 5.8% | |
| 7 | 149724 | 5.2% | |
| 0 | 136542 | 4.7% | |
| ' | 38858 | 1.3% | |
| - | 11337 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 8911621 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 675677 | 7.6% | ||
| e | 530920 | 6.0% | |
| a | 514000 | 5.8% | |
| o | 424806 | 4.8% | |
| t | 418174 | 4.7% | |
| s | 417630 | 4.7% | |
| n | 351359 | 3.9% | |
| # | 287613 | 3.2% | |
| i | 279424 | 3.1% | |
| c | 264733 | 3.0% | |
| r | 256192 | 2.9% | |
| m | 239687 | 2.7% | |
| l | 229032 | 2.6% | |
| u | 209178 | 2.3% | |
| 1 | 192937 | 2.2% | |
| 4 | 186127 | 2.1% | |
| 2 | 185984 | 2.1% | |
| 3 | 176825 | 2.0% | |
| 9 | 174391 | 2.0% | |
| . | 173569 | 1.9% | |
| 6 | 173525 | 1.9% | |
| 5 | 167214 | 1.9% | |
| 8 | 166925 | 1.9% | |
| 7 | 149724 | 1.7% | |
| y | 139156 | 1.6% | |
| Other values (39) | 1926819 | 21.6% |
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 3913 |
| Missing (%) | 0.6% |
| Memory size | 4.9 MiB |
| US | |
|---|---|
| MEX | 2626 |
| CAN | 1870 |
| PR | 1202 |
| Value | Count | Frequency (%) | |
| US | 632303 | 98.5% | |
| MEX | 2626 | 0.4% | |
| CAN | 1870 | 0.3% | |
| PR | 1202 | 0.2% | |
| (Missing) | 3913 | 0.6% |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.013099886 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| U | 632303 | 48.9% | |
| S | 632303 | 48.9% | |
| n | 7826 | 0.6% | |
| a | 3913 | 0.3% | |
| M | 2626 | 0.2% | |
| E | 2626 | 0.2% | |
| X | 2626 | 0.2% | |
| C | 1870 | 0.1% | |
| A | 1870 | 0.1% | |
| N | 1870 | 0.1% | |
| P | 1202 | 0.1% | |
| R | 1202 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 1280498 | 99.1% | |
| Lowercase Letter | 11739 | 0.9% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| U | 632303 | 49.4% | |
| S | 632303 | 49.4% | |
| M | 2626 | 0.2% | |
| E | 2626 | 0.2% | |
| X | 2626 | 0.2% | |
| C | 1870 | 0.1% | |
| A | 1870 | 0.1% | |
| N | 1870 | 0.1% | |
| P | 1202 | 0.1% | |
| R | 1202 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 7826 | 66.7% | |
| a | 3913 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1292237 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| U | 632303 | 48.9% | |
| S | 632303 | 48.9% | |
| n | 7826 | 0.6% | |
| a | 3913 | 0.3% | |
| M | 2626 | 0.2% | |
| E | 2626 | 0.2% | |
| X | 2626 | 0.2% | |
| C | 1870 | 0.1% | |
| A | 1870 | 0.1% | |
| N | 1870 | 0.1% | |
| P | 1202 | 0.1% | |
| R | 1202 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1292237 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| U | 632303 | 48.9% | |
| S | 632303 | 48.9% | |
| n | 7826 | 0.6% | |
| a | 3913 | 0.3% | |
| M | 2626 | 0.2% | |
| E | 2626 | 0.2% | |
| X | 2626 | 0.2% | |
| C | 1870 | 0.1% | |
| A | 1870 | 0.1% | |
| N | 1870 | 0.1% | |
| P | 1202 | 0.1% | |
| R | 1202 | 0.1% |
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 624 |
| Missing (%) | 0.1% |
| Memory size | 4.9 MiB |
| US | |
|---|---|
| MEX | 2636 |
| CAN | 1874 |
| PR | 1203 |
| Value | Count | Frequency (%) | |
| US | 635577 | 99.0% | |
| MEX | 2636 | 0.4% | |
| CAN | 1874 | 0.3% | |
| PR | 1203 | 0.2% | |
| (Missing) | 624 | 0.1% |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.007997956 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| U | 635577 | 49.3% | |
| S | 635577 | 49.3% | |
| M | 2636 | 0.2% | |
| E | 2636 | 0.2% | |
| X | 2636 | 0.2% | |
| C | 1874 | 0.1% | |
| A | 1874 | 0.1% | |
| N | 1874 | 0.1% | |
| n | 1248 | 0.1% | |
| P | 1203 | 0.1% | |
| R | 1203 | 0.1% | |
| a | 624 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 1287090 | 99.9% | |
| Lowercase Letter | 1872 | 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| U | 635577 | 49.4% | |
| S | 635577 | 49.4% | |
| M | 2636 | 0.2% | |
| E | 2636 | 0.2% | |
| X | 2636 | 0.2% | |
| C | 1874 | 0.1% | |
| A | 1874 | 0.1% | |
| N | 1874 | 0.1% | |
| P | 1203 | 0.1% | |
| R | 1203 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1248 | 66.7% | |
| a | 624 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1288962 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| U | 635577 | 49.3% | |
| S | 635577 | 49.3% | |
| M | 2636 | 0.2% | |
| E | 2636 | 0.2% | |
| X | 2636 | 0.2% | |
| C | 1874 | 0.1% | |
| A | 1874 | 0.1% | |
| N | 1874 | 0.1% | |
| n | 1248 | 0.1% | |
| P | 1203 | 0.1% | |
| R | 1203 | 0.1% | |
| a | 624 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1288962 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| U | 635577 | 49.3% | |
| S | 635577 | 49.3% | |
| M | 2636 | 0.2% | |
| E | 2636 | 0.2% | |
| X | 2636 | 0.2% | |
| C | 1874 | 0.1% | |
| A | 1874 | 0.1% | |
| N | 1874 | 0.1% | |
| n | 1248 | 0.1% | |
| P | 1203 | 0.1% | |
| R | 1203 | 0.1% | |
| a | 624 | < 0.1% |
posEntryMode
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 3345 |
| Missing (%) | 0.5% |
| Memory size | 4.9 MiB |
| 05 | |
|---|---|
| 09 | |
| 02 | |
| 90 | 16251 |
| 80 | 12921 |
| Value | Count | Frequency (%) | |
| 05 | 255615 | 39.8% | |
| 09 | 193193 | 30.1% | |
| 02 | 160589 | 25.0% | |
| 90 | 16251 | 2.5% | |
| 80 | 12921 | 2.0% | |
| (Missing) | 3345 | 0.5% |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.005210978 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 638569 | 49.6% | |
| 5 | 255615 | 19.9% | |
| 9 | 209444 | 16.3% | |
| 2 | 160589 | 12.5% | |
| 8 | 12921 | 1.0% | |
| n | 6690 | 0.5% | |
| a | 3345 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 1277138 | 99.2% | |
| Lowercase Letter | 10035 | 0.8% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 638569 | 50.0% | |
| 5 | 255615 | 20.0% | |
| 9 | 209444 | 16.4% | |
| 2 | 160589 | 12.6% | |
| 8 | 12921 | 1.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 6690 | 66.7% | |
| a | 3345 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 1277138 | 99.2% | |
| Latin | 10035 | 0.8% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 638569 | 50.0% | |
| 5 | 255615 | 20.0% | |
| 9 | 209444 | 16.4% | |
| 2 | 160589 | 12.6% | |
| 8 | 12921 | 1.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 6690 | 66.7% | |
| a | 3345 | 33.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1287173 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 638569 | 49.6% | |
| 5 | 255615 | 19.9% | |
| 9 | 209444 | 16.3% | |
| 2 | 160589 | 12.5% | |
| 8 | 12921 | 1.0% | |
| n | 6690 | 0.5% | |
| a | 3345 | 0.3% |
posConditionCode
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 287 |
| Missing (%) | < 0.1% |
| Memory size | 4.9 MiB |
| 01 | |
|---|---|
| 08 | |
| 99 | 5976 |
| Value | Count | Frequency (%) | |
| 01 | 514144 | 80.1% | |
| 08 | 121507 | 18.9% | |
| 99 | 5976 | 0.9% | |
| (Missing) | 287 | < 0.1% |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.0004471 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 635651 | 49.5% | |
| 1 | 514144 | 40.0% | |
| 8 | 121507 | 9.5% | |
| 9 | 11952 | 0.9% | |
| n | 574 | < 0.1% | |
| a | 287 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 1283254 | 99.9% | |
| Lowercase Letter | 861 | 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 635651 | 49.5% | |
| 1 | 514144 | 40.1% | |
| 8 | 121507 | 9.5% | |
| 9 | 11952 | 0.9% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 574 | 66.7% | |
| a | 287 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 1283254 | 99.9% | |
| Latin | 861 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 635651 | 49.5% | |
| 1 | 514144 | 40.1% | |
| 8 | 121507 | 9.5% | |
| 9 | 11952 | 0.9% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 574 | 66.7% | |
| a | 287 | 33.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1284115 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 635651 | 49.5% | |
| 1 | 514144 | 40.0% | |
| 8 | 121507 | 9.5% | |
| 9 | 11952 | 0.9% | |
| n | 574 | < 0.1% | |
| a | 287 | < 0.1% |
merchantCategoryCode
Categorical
| Distinct count | 19 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.9 MiB |
| online_retail | |
|---|---|
| fastfood | |
| entertainment | |
| food | |
| rideshare | |
| Other values (14) |
| Value | Count | Frequency (%) | |
| online_retail | 161469 | 25.2% | |
| fastfood | 101196 | 15.8% | |
| entertainment | 69138 | 10.8% | |
| food | 68245 | 10.6% | |
| rideshare | 50574 | 7.9% | |
| online_gifts | 33045 | 5.1% | |
| hotels | 22879 | 3.6% | |
| fuel | 22566 | 3.5% | |
| subscriptions | 18376 | 2.9% | |
| personal care | 16917 | 2.6% | |
| mobileapps | 14614 | 2.3% | |
| health | 14344 | 2.2% | |
| online_subscriptions | 11247 | 1.8% | |
| auto | 10147 | 1.6% | |
| airline | 9990 | 1.6% | |
| furniture | 7813 | 1.2% | |
| food_delivery | 4990 | 0.8% | |
| gym | 2874 | 0.4% | |
| cable/phone | 1490 | 0.2% |
Length
| Max length | 20 |
|---|---|
| Median length | 10 |
| Mean length | 9.886609421 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 814792 | 12.8% | |
| n | 684769 | 10.8% | |
| o | 650293 | 10.2% | |
| i | 626630 | 9.9% | |
| t | 587930 | 9.3% | |
| l | 475020 | 7.5% | |
| a | 466796 | 7.4% | |
| r | 425818 | 6.7% | |
| f | 339051 | 5.3% | |
| s | 328094 | 5.2% | |
| d | 229995 | 3.6% | |
| _ | 210751 | 3.3% | |
| h | 103631 | 1.6% | |
| m | 86626 | 1.4% | |
| u | 77962 | 1.2% | |
| p | 77258 | 1.2% | |
| c | 48030 | 0.8% | |
| b | 45727 | 0.7% | |
| g | 35919 | 0.6% | |
| 16917 | 0.3% | ||
| y | 7864 | 0.1% | |
| v | 4990 | 0.1% | |
| / | 1490 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 6117195 | 96.4% | |
| Connector Punctuation | 210751 | 3.3% | |
| Space Separator | 16917 | 0.3% | |
| Other Punctuation | 1490 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 814792 | 13.3% | |
| n | 684769 | 11.2% | |
| o | 650293 | 10.6% | |
| i | 626630 | 10.2% | |
| t | 587930 | 9.6% | |
| l | 475020 | 7.8% | |
| a | 466796 | 7.6% | |
| r | 425818 | 7.0% | |
| f | 339051 | 5.5% | |
| s | 328094 | 5.4% | |
| d | 229995 | 3.8% | |
| h | 103631 | 1.7% | |
| m | 86626 | 1.4% | |
| u | 77962 | 1.3% | |
| p | 77258 | 1.3% | |
| c | 48030 | 0.8% | |
| b | 45727 | 0.7% | |
| g | 35919 | 0.6% | |
| y | 7864 | 0.1% | |
| v | 4990 | 0.1% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 210751 | 100.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 16917 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 1490 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 6117195 | 96.4% | |
| Common | 229158 | 3.6% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 814792 | 13.3% | |
| n | 684769 | 11.2% | |
| o | 650293 | 10.6% | |
| i | 626630 | 10.2% | |
| t | 587930 | 9.6% | |
| l | 475020 | 7.8% | |
| a | 466796 | 7.6% | |
| r | 425818 | 7.0% | |
| f | 339051 | 5.5% | |
| s | 328094 | 5.4% | |
| d | 229995 | 3.8% | |
| h | 103631 | 1.7% | |
| m | 86626 | 1.4% | |
| u | 77962 | 1.3% | |
| p | 77258 | 1.3% | |
| c | 48030 | 0.8% | |
| b | 45727 | 0.7% | |
| g | 35919 | 0.6% | |
| y | 7864 | 0.1% | |
| v | 4990 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| _ | 210751 | 92.0% | |
| 16917 | 7.4% | ||
| / | 1490 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 6346353 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 814792 | 12.8% | |
| n | 684769 | 10.8% | |
| o | 650293 | 10.2% | |
| i | 626630 | 9.9% | |
| t | 587930 | 9.3% | |
| l | 475020 | 7.5% | |
| a | 466796 | 7.4% | |
| r | 425818 | 6.7% | |
| f | 339051 | 5.3% | |
| s | 328094 | 5.2% | |
| d | 229995 | 3.6% | |
| _ | 210751 | 3.3% | |
| h | 103631 | 1.6% | |
| m | 86626 | 1.4% | |
| u | 77962 | 1.2% | |
| p | 77258 | 1.2% | |
| c | 48030 | 0.8% | |
| b | 45727 | 0.7% | |
| g | 35919 | 0.6% | |
| 16917 | 0.3% | ||
| y | 7864 | 0.1% | |
| v | 4990 | 0.1% | |
| / | 1490 | < 0.1% |
| Distinct count | 165 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.9 MiB |
| 05/2026 | 4209 |
|---|---|
| 10/2019 | 4201 |
| 08/2020 | 4188 |
| 05/2028 | 4186 |
| 01/2025 | 4154 |
| Other values (160) |
| Value | Count | Frequency (%) | |
| 05/2026 | 4209 | 0.7% | |
| 10/2019 | 4201 | 0.7% | |
| 08/2020 | 4188 | 0.7% | |
| 05/2028 | 4186 | 0.7% | |
| 01/2025 | 4154 | 0.6% | |
| 05/2024 | 4153 | 0.6% | |
| 08/2028 | 4119 | 0.6% | |
| 03/2024 | 4117 | 0.6% | |
| 08/2018 | 4116 | 0.6% | |
| 03/2019 | 4115 | 0.6% | |
| 07/2018 | 4109 | 0.6% | |
| 05/2025 | 4104 | 0.6% | |
| 10/2023 | 4103 | 0.6% | |
| 03/2028 | 4099 | 0.6% | |
| 08/2026 | 4097 | 0.6% | |
| 12/2021 | 4094 | 0.6% | |
| 08/2025 | 4091 | 0.6% | |
| 01/2023 | 4091 | 0.6% | |
| 01/2021 | 4090 | 0.6% | |
| 03/2023 | 4081 | 0.6% | |
| 05/2031 | 4074 | 0.6% | |
| 10/2031 | 4073 | 0.6% | |
| 05/2029 | 4068 | 0.6% | |
| 10/2024 | 4068 | 0.6% | |
| 03/2030 | 4066 | 0.6% | |
| Other values (140) | 539048 | 84.0% |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 1268502 | 28.2% | |
| 2 | 1263302 | 28.1% | |
| / | 641914 | 14.3% | |
| 1 | 443398 | 9.9% | |
| 3 | 196932 | 4.4% | |
| 9 | 146891 | 3.3% | |
| 8 | 131749 | 2.9% | |
| 7 | 101928 | 2.3% | |
| 5 | 100742 | 2.2% | |
| 6 | 100713 | 2.2% | |
| 4 | 97327 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 3851484 | 85.7% | |
| Other Punctuation | 641914 | 14.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 1268502 | 32.9% | |
| 2 | 1263302 | 32.8% | |
| 1 | 443398 | 11.5% | |
| 3 | 196932 | 5.1% | |
| 9 | 146891 | 3.8% | |
| 8 | 131749 | 3.4% | |
| 7 | 101928 | 2.6% | |
| 5 | 100742 | 2.6% | |
| 6 | 100713 | 2.6% | |
| 4 | 97327 | 2.5% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 641914 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 4493398 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 1268502 | 28.2% | |
| 2 | 1263302 | 28.1% | |
| / | 641914 | 14.3% | |
| 1 | 443398 | 9.9% | |
| 3 | 196932 | 4.4% | |
| 9 | 146891 | 3.3% | |
| 8 | 131749 | 2.9% | |
| 7 | 101928 | 2.3% | |
| 5 | 100742 | 2.2% | |
| 6 | 100713 | 2.2% | |
| 4 | 97327 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 4493398 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 1268502 | 28.2% | |
| 2 | 1263302 | 28.1% | |
| / | 641914 | 14.3% | |
| 1 | 443398 | 9.9% | |
| 3 | 196932 | 4.4% | |
| 9 | 146891 | 3.3% | |
| 8 | 131749 | 2.9% | |
| 7 | 101928 | 2.3% | |
| 5 | 100742 | 2.2% | |
| 6 | 100713 | 2.2% | |
| 4 | 97327 | 2.2% |
| Distinct count | 1826 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.9 MiB |
| 2015-12-11 | 10137 |
|---|---|
| 2012-10-05 | 8382 |
| 2011-05-20 | 5494 |
| 2015-09-24 | 5478 |
| 2015-03-12 | 5398 |
| Other values (1821) |
| Value | Count | Frequency (%) | |
| 2015-12-11 | 10137 | 1.6% | |
| 2012-10-05 | 8382 | 1.3% | |
| 2011-05-20 | 5494 | 0.9% | |
| 2015-09-24 | 5478 | 0.9% | |
| 2015-03-12 | 5398 | 0.8% | |
| 2013-08-24 | 4836 | 0.8% | |
| 2014-01-31 | 4753 | 0.7% | |
| 2015-06-15 | 4612 | 0.7% | |
| 2013-07-04 | 3780 | 0.6% | |
| 2014-09-18 | 3311 | 0.5% | |
| 2015-01-10 | 3209 | 0.5% | |
| 2014-08-27 | 3162 | 0.5% | |
| 2014-01-11 | 3081 | 0.5% | |
| 2015-12-26 | 3051 | 0.5% | |
| 2013-09-09 | 3046 | 0.5% | |
| 2014-05-24 | 2983 | 0.5% | |
| 2014-06-06 | 2715 | 0.4% | |
| 2013-06-15 | 2687 | 0.4% | |
| 2014-05-25 | 2614 | 0.4% | |
| 2015-01-12 | 2584 | 0.4% | |
| 2012-10-30 | 2411 | 0.4% | |
| 2015-05-20 | 2407 | 0.4% | |
| 2015-06-26 | 2395 | 0.4% | |
| 2015-03-30 | 2285 | 0.4% | |
| 2015-03-16 | 2272 | 0.4% | |
| Other values (1801) | 544831 | 84.9% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 1441001 | 22.4% | |
| - | 1283828 | 20.0% | |
| 1 | 1251454 | 19.5% | |
| 2 | 1088955 | 17.0% | |
| 5 | 390495 | 6.1% | |
| 4 | 255696 | 4.0% | |
| 3 | 237023 | 3.7% | |
| 6 | 121556 | 1.9% | |
| 9 | 120394 | 1.9% | |
| 8 | 120016 | 1.9% | |
| 7 | 108722 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 5135312 | 80.0% | |
| Dash Punctuation | 1283828 | 20.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 1441001 | 28.1% | |
| 1 | 1251454 | 24.4% | |
| 2 | 1088955 | 21.2% | |
| 5 | 390495 | 7.6% | |
| 4 | 255696 | 5.0% | |
| 3 | 237023 | 4.6% | |
| 6 | 121556 | 2.4% | |
| 9 | 120394 | 2.3% | |
| 8 | 120016 | 2.3% | |
| 7 | 108722 | 2.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 1283828 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 6419140 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 1441001 | 22.4% | |
| - | 1283828 | 20.0% | |
| 1 | 1251454 | 19.5% | |
| 2 | 1088955 | 17.0% | |
| 5 | 390495 | 6.1% | |
| 4 | 255696 | 4.0% | |
| 3 | 237023 | 3.7% | |
| 6 | 121556 | 1.9% | |
| 9 | 120394 | 1.9% | |
| 8 | 120016 | 1.9% | |
| 7 | 108722 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 6419140 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 1441001 | 22.4% | |
| - | 1283828 | 20.0% | |
| 1 | 1251454 | 19.5% | |
| 2 | 1088955 | 17.0% | |
| 5 | 390495 | 6.1% | |
| 4 | 255696 | 4.0% | |
| 3 | 237023 | 3.7% | |
| 6 | 121556 | 1.9% | |
| 9 | 120394 | 1.9% | |
| 8 | 120016 | 1.9% | |
| 7 | 108722 | 1.7% |
| Distinct count | 2186 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.9 MiB |
| 2016-07-20 | 3948 |
|---|---|
| 2016-03-15 | 3800 |
| 2016-01-26 | 3140 |
| 2016-01-29 | 3033 |
| 2016-04-25 | 2954 |
| Other values (2181) |
| Value | Count | Frequency (%) | |
| 2016-07-20 | 3948 | 0.6% | |
| 2016-03-15 | 3800 | 0.6% | |
| 2016-01-26 | 3140 | 0.5% | |
| 2016-01-29 | 3033 | 0.5% | |
| 2016-04-25 | 2954 | 0.5% | |
| 2016-07-22 | 2943 | 0.5% | |
| 2016-06-12 | 2810 | 0.4% | |
| 2016-06-06 | 2672 | 0.4% | |
| 2016-04-11 | 2638 | 0.4% | |
| 2016-01-20 | 2599 | 0.4% | |
| 2016-01-16 | 2400 | 0.4% | |
| 2016-07-18 | 2368 | 0.4% | |
| 2016-06-16 | 2337 | 0.4% | |
| 2016-08-04 | 2332 | 0.4% | |
| 2016-05-17 | 2326 | 0.4% | |
| 2016-08-01 | 2322 | 0.4% | |
| 2016-08-02 | 2231 | 0.3% | |
| 2016-03-16 | 2201 | 0.3% | |
| 2016-06-14 | 2170 | 0.3% | |
| 2016-05-25 | 2102 | 0.3% | |
| 2016-02-07 | 2094 | 0.3% | |
| 2016-03-26 | 2022 | 0.3% | |
| 2016-02-25 | 2021 | 0.3% | |
| 2016-05-11 | 2008 | 0.3% | |
| 2016-02-13 | 1960 | 0.3% | |
| Other values (2161) | 578483 | 90.1% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 1465305 | 22.8% | |
| - | 1283828 | 20.0% | |
| 1 | 1183131 | 18.4% | |
| 2 | 1054724 | 16.4% | |
| 6 | 407039 | 6.3% | |
| 5 | 278608 | 4.3% | |
| 3 | 204439 | 3.2% | |
| 4 | 198616 | 3.1% | |
| 8 | 118059 | 1.8% | |
| 7 | 115894 | 1.8% | |
| 9 | 109497 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 5135312 | 80.0% | |
| Dash Punctuation | 1283828 | 20.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 1465305 | 28.5% | |
| 1 | 1183131 | 23.0% | |
| 2 | 1054724 | 20.5% | |
| 6 | 407039 | 7.9% | |
| 5 | 278608 | 5.4% | |
| 3 | 204439 | 4.0% | |
| 4 | 198616 | 3.9% | |
| 8 | 118059 | 2.3% | |
| 7 | 115894 | 2.3% | |
| 9 | 109497 | 2.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 1283828 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 6419140 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 1465305 | 22.8% | |
| - | 1283828 | 20.0% | |
| 1 | 1183131 | 18.4% | |
| 2 | 1054724 | 16.4% | |
| 6 | 407039 | 6.3% | |
| 5 | 278608 | 4.3% | |
| 3 | 204439 | 3.2% | |
| 4 | 198616 | 3.1% | |
| 8 | 118059 | 1.8% | |
| 7 | 115894 | 1.8% | |
| 9 | 109497 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 6419140 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 1465305 | 22.8% | |
| - | 1283828 | 20.0% | |
| 1 | 1183131 | 18.4% | |
| 2 | 1054724 | 16.4% | |
| 6 | 407039 | 6.3% | |
| 5 | 278608 | 4.3% | |
| 3 | 204439 | 3.2% | |
| 4 | 198616 | 3.1% | |
| 8 | 118059 | 1.8% | |
| 7 | 115894 | 1.8% | |
| 9 | 109497 | 1.7% |
| Distinct count | 899 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 557.1999270930373 |
|---|---|
| Minimum | 100 |
| Maximum | 998 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.9 MiB |
Quantile statistics
| Minimum | 100 |
|---|---|
| 5-th percentile | 148 |
| Q1 | 334 |
| median | 581 |
| Q3 | 762 |
| 95-th percentile | 954 |
| Maximum | 998 |
| Range | 898 |
| Interquartile range (IQR) | 428 |
Descriptive statistics
| Standard deviation | 257.3262041 |
|---|---|
| Coefficient of variation (CV) | 0.46182024 |
| Kurtosis | -1.147292612 |
| Mean | 557.1999271 |
| Median Absolute Deviation (MAD) | 214 |
| Skewness | -0.07758885003 |
| Sum | 357674434 |
| Variance | 66216.7753 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 633 | 11354 | 1.8% | |
| 746 | 8886 | 1.4% | |
| 625 | 7626 | 1.2% | |
| 312 | 6583 | 1.0% | |
| 986 | 6464 | 1.0% | |
| 676 | 5780 | 0.9% | |
| 731 | 4979 | 0.8% | |
| 180 | 4039 | 0.6% | |
| 465 | 3869 | 0.6% | |
| 654 | 3808 | 0.6% | |
| 324 | 3633 | 0.6% | |
| 815 | 3567 | 0.6% | |
| 713 | 3529 | 0.5% | |
| 467 | 3518 | 0.5% | |
| 383 | 3357 | 0.5% | |
| 148 | 2892 | 0.5% | |
| 910 | 2818 | 0.4% | |
| 874 | 2751 | 0.4% | |
| 921 | 2733 | 0.4% | |
| 135 | 2657 | 0.4% | |
| 166 | 2643 | 0.4% | |
| 439 | 2608 | 0.4% | |
| 576 | 2573 | 0.4% | |
| 126 | 2503 | 0.4% | |
| 951 | 2440 | 0.4% | |
| Other values (874) | 534304 | 83.2% |
| Value | Count | Frequency (%) | |
| 100 | 431 | 0.1% | |
| 101 | 116 | < 0.1% | |
| 102 | 343 | 0.1% | |
| 103 | 77 | < 0.1% | |
| 104 | 1770 | 0.3% | |
| 105 | 1085 | 0.2% | |
| 106 | 1119 | 0.2% | |
| 107 | 300 | < 0.1% | |
| 108 | 2193 | 0.3% | |
| 109 | 781 | 0.1% |
| Value | Count | Frequency (%) | |
| 998 | 209 | < 0.1% | |
| 997 | 378 | 0.1% | |
| 996 | 136 | < 0.1% | |
| 995 | 259 | < 0.1% | |
| 994 | 331 | 0.1% | |
| 993 | 1424 | 0.2% | |
| 992 | 570 | 0.1% | |
| 991 | 519 | 0.1% | |
| 990 | 278 | < 0.1% | |
| 989 | 699 | 0.1% |
| Distinct count | 980 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 556.775159912387 |
|---|---|
| Minimum | 1 |
| Maximum | 998 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 148 |
| Q1 | 333 |
| median | 580 |
| Q3 | 761 |
| 95-th percentile | 954 |
| Maximum | 998 |
| Range | 997 |
| Interquartile range (IQR) | 428 |
Descriptive statistics
| Standard deviation | 257.4026393 |
|---|---|
| Coefficient of variation (CV) | 0.4623098476 |
| Kurtosis | -1.146706164 |
| Mean | 556.7751599 |
| Median Absolute Deviation (MAD) | 214 |
| Skewness | -0.0771430696 |
| Sum | 357401770 |
| Variance | 66256.11874 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 633 | 11254 | 1.8% | |
| 746 | 8816 | 1.4% | |
| 625 | 7559 | 1.2% | |
| 312 | 6524 | 1.0% | |
| 986 | 6399 | 1.0% | |
| 676 | 5745 | 0.9% | |
| 731 | 4923 | 0.8% | |
| 180 | 4007 | 0.6% | |
| 465 | 3825 | 0.6% | |
| 654 | 3775 | 0.6% | |
| 324 | 3603 | 0.6% | |
| 815 | 3529 | 0.5% | |
| 467 | 3503 | 0.5% | |
| 713 | 3495 | 0.5% | |
| 383 | 3345 | 0.5% | |
| 148 | 2864 | 0.4% | |
| 910 | 2789 | 0.4% | |
| 874 | 2743 | 0.4% | |
| 921 | 2704 | 0.4% | |
| 135 | 2640 | 0.4% | |
| 166 | 2622 | 0.4% | |
| 439 | 2590 | 0.4% | |
| 576 | 2569 | 0.4% | |
| 126 | 2508 | 0.4% | |
| 951 | 2425 | 0.4% | |
| Other values (955) | 535158 | 83.4% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 2 | 2 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 2 | < 0.1% | |
| 6 | 2 | < 0.1% | |
| 7 | 5 | < 0.1% | |
| 8 | 2 | < 0.1% | |
| 9 | 4 | < 0.1% | |
| 10 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 998 | 208 | < 0.1% | |
| 997 | 378 | 0.1% | |
| 996 | 139 | < 0.1% | |
| 995 | 259 | < 0.1% | |
| 994 | 328 | 0.1% | |
| 993 | 1410 | 0.2% | |
| 992 | 569 | 0.1% | |
| 991 | 519 | 0.1% | |
| 990 | 279 | < 0.1% | |
| 989 | 692 | 0.1% |
| Distinct count | 5134 |
|---|---|
| Unique (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4886.18404334537 |
|---|---|
| Minimum | 0 |
| Maximum | 9998 |
| Zeros | 6727 |
| Zeros (%) | 1.0% |
| Memory size | 4.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 359 |
| Q1 | 2364 |
| median | 4873 |
| Q3 | 7267 |
| 95-th percentile | 9484 |
| Maximum | 9998 |
| Range | 9998 |
| Interquartile range (IQR) | 4903 |
Descriptive statistics
| Standard deviation | 2859.053679 |
|---|---|
| Coefficient of variation (CV) | 0.5851301657 |
| Kurtosis | -1.145169532 |
| Mean | 4886.184043 |
| Median Absolute Deviation (MAD) | 2435 |
| Skewness | 0.02565465023 |
| Sum | 3136509944 |
| Variance | 8174187.938 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 1789 | 10034 | 1.6% | |
| 5658 | 8412 | 1.3% | |
| 0 | 6727 | 1.0% | |
| 5335 | 5542 | 0.9% | |
| 4062 | 5146 | 0.8% | |
| 4690 | 4435 | 0.7% | |
| 7267 | 4227 | 0.7% | |
| 2705 | 3766 | 0.6% | |
| 2640 | 3414 | 0.5% | |
| 6060 | 3046 | 0.5% | |
| 1548 | 3021 | 0.5% | |
| 4737 | 2983 | 0.5% | |
| 4008 | 2615 | 0.4% | |
| 3165 | 2387 | 0.4% | |
| 8212 | 2379 | 0.4% | |
| 5358 | 2344 | 0.4% | |
| 1437 | 2326 | 0.4% | |
| 9716 | 2275 | 0.4% | |
| 3147 | 2084 | 0.3% | |
| 6310 | 1978 | 0.3% | |
| 4431 | 1956 | 0.3% | |
| 2242 | 1917 | 0.3% | |
| 6606 | 1875 | 0.3% | |
| 2330 | 1855 | 0.3% | |
| 6616 | 1754 | 0.3% | |
| Other values (5109) | 553416 | 86.2% |
| Value | Count | Frequency (%) | |
| 0 | 6727 | 1.0% | |
| 1 | 51 | < 0.1% | |
| 3 | 157 | < 0.1% | |
| 4 | 28 | < 0.1% | |
| 5 | 46 | < 0.1% | |
| 7 | 106 | < 0.1% | |
| 9 | 18 | < 0.1% | |
| 10 | 74 | < 0.1% | |
| 11 | 30 | < 0.1% | |
| 16 | 18 | < 0.1% |
| Value | Count | Frequency (%) | |
| 9998 | 53 | < 0.1% | |
| 9997 | 1 | < 0.1% | |
| 9995 | 152 | < 0.1% | |
| 9994 | 26 | < 0.1% | |
| 9990 | 95 | < 0.1% | |
| 9988 | 2 | < 0.1% | |
| 9985 | 100 | < 0.1% | |
| 9984 | 1431 | 0.2% | |
| 9983 | 9 | < 0.1% | |
| 9982 | 519 | 0.1% |
transactionType
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 589 |
| Missing (%) | 0.1% |
| Memory size | 4.9 MiB |
| PURCHASE | |
|---|---|
| ADDRESS_VERIFICATION | 16478 |
| REVERSAL | 16162 |
| Value | Count | Frequency (%) | |
| PURCHASE | 608685 | 94.8% | |
| ADDRESS_VERIFICATION | 16478 | 2.6% | |
| REVERSAL | 16162 | 2.5% | |
| (Missing) | 589 | 0.1% |
Length
| Max length | 20 |
|---|---|
| Median length | 8 |
| Mean length | 8.303453422 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| R | 673965 | 12.6% | |
| E | 673965 | 12.6% | |
| A | 657803 | 12.3% | |
| S | 657803 | 12.3% | |
| C | 625163 | 11.7% | |
| P | 608685 | 11.4% | |
| U | 608685 | 11.4% | |
| H | 608685 | 11.4% | |
| I | 49434 | 0.9% | |
| D | 32956 | 0.6% | |
| V | 32640 | 0.6% | |
| _ | 16478 | 0.3% | |
| F | 16478 | 0.3% | |
| T | 16478 | 0.3% | |
| O | 16478 | 0.3% | |
| N | 16478 | 0.3% | |
| L | 16162 | 0.3% | |
| n | 1178 | < 0.1% | |
| a | 589 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 5311858 | 99.7% | |
| Connector Punctuation | 16478 | 0.3% | |
| Lowercase Letter | 1767 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| R | 673965 | 12.7% | |
| E | 673965 | 12.7% | |
| A | 657803 | 12.4% | |
| S | 657803 | 12.4% | |
| C | 625163 | 11.8% | |
| P | 608685 | 11.5% | |
| U | 608685 | 11.5% | |
| H | 608685 | 11.5% | |
| I | 49434 | 0.9% | |
| D | 32956 | 0.6% | |
| V | 32640 | 0.6% | |
| F | 16478 | 0.3% | |
| T | 16478 | 0.3% | |
| O | 16478 | 0.3% | |
| N | 16478 | 0.3% | |
| L | 16162 | 0.3% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 16478 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1178 | 66.7% | |
| a | 589 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 5313625 | 99.7% | |
| Common | 16478 | 0.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| R | 673965 | 12.7% | |
| E | 673965 | 12.7% | |
| A | 657803 | 12.4% | |
| S | 657803 | 12.4% | |
| C | 625163 | 11.8% | |
| P | 608685 | 11.5% | |
| U | 608685 | 11.5% | |
| H | 608685 | 11.5% | |
| I | 49434 | 0.9% | |
| D | 32956 | 0.6% | |
| V | 32640 | 0.6% | |
| F | 16478 | 0.3% | |
| T | 16478 | 0.3% | |
| O | 16478 | 0.3% | |
| N | 16478 | 0.3% | |
| L | 16162 | 0.3% | |
| n | 1178 | < 0.1% | |
| a | 589 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| _ | 16478 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 5330103 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| R | 673965 | 12.6% | |
| E | 673965 | 12.6% | |
| A | 657803 | 12.3% | |
| S | 657803 | 12.3% | |
| C | 625163 | 11.7% | |
| P | 608685 | 11.4% | |
| U | 608685 | 11.4% | |
| H | 608685 | 11.4% | |
| I | 49434 | 0.9% | |
| D | 32956 | 0.6% | |
| V | 32640 | 0.6% | |
| _ | 16478 | 0.3% | |
| F | 16478 | 0.3% | |
| T | 16478 | 0.3% | |
| O | 16478 | 0.3% | |
| N | 16478 | 0.3% | |
| L | 16162 | 0.3% | |
| n | 1178 | < 0.1% | |
| a | 589 | < 0.1% |
isFraud
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 627.0 KiB |
| False | |
|---|---|
| True | 11302 |
| Value | Count | Frequency (%) | |
| False | 630612 | 98.2% | |
| True | 11302 | 1.8% |
| Distinct count | 406990 |
|---|---|
| Unique (%) | 63.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4044.3820349922275 |
|---|---|
| Minimum | 0.0 |
| Maximum | 47496.5 |
| Zeros | 33622 |
| Zeros (%) | 5.2% |
| Memory size | 4.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 502.4425 |
| median | 2151.86 |
| Q3 | 5005.89 |
| 95-th percentile | 13782.554 |
| Maximum | 47496.5 |
| Range | 47496.5 |
| Interquartile range (IQR) | 4503.4475 |
Descriptive statistics
| Standard deviation | 5945.510224 |
|---|---|
| Coefficient of variation (CV) | 1.470066421 |
| Kurtosis | 17.3694379 |
| Mean | 4044.382035 |
| Median Absolute Deviation (MAD) | 1893.22 |
| Skewness | 3.600021658 |
| Sum | 2596145450 |
| Variance | 35349091.82 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 33622 | 5.2% | |
| 53.52 | 21 | < 0.1% | |
| 40.48 | 19 | < 0.1% | |
| 45.53 | 18 | < 0.1% | |
| 53.86 | 18 | < 0.1% | |
| 21.37 | 17 | < 0.1% | |
| 6.03 | 16 | < 0.1% | |
| 63.01 | 16 | < 0.1% | |
| 6.24 | 16 | < 0.1% | |
| 6.96 | 16 | < 0.1% | |
| 29.72 | 16 | < 0.1% | |
| 14.52 | 15 | < 0.1% | |
| 7.67 | 15 | < 0.1% | |
| 5.33 | 15 | < 0.1% | |
| 118.22 | 15 | < 0.1% | |
| 35.71 | 15 | < 0.1% | |
| 31.41 | 15 | < 0.1% | |
| 214.72 | 14 | < 0.1% | |
| 91.06 | 14 | < 0.1% | |
| 5.38 | 14 | < 0.1% | |
| 35.06 | 14 | < 0.1% | |
| 16.12 | 14 | < 0.1% | |
| 7.23 | 14 | < 0.1% | |
| 6.77 | 14 | < 0.1% | |
| 27.05 | 14 | < 0.1% | |
| Other values (406965) | 607917 | 94.7% |
| Value | Count | Frequency (%) | |
| 0 | 33622 | 5.2% | |
| 0.01 | 1 | < 0.1% | |
| 0.03 | 1 | < 0.1% | |
| 0.04 | 2 | < 0.1% | |
| 0.05 | 4 | < 0.1% | |
| 0.06 | 4 | < 0.1% | |
| 0.07 | 1 | < 0.1% | |
| 0.08 | 6 | < 0.1% | |
| 0.09 | 3 | < 0.1% | |
| 0.11 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 47496.5 | 1 | < 0.1% | |
| 47496.34 | 1 | < 0.1% | |
| 47494.26 | 1 | < 0.1% | |
| 47491.52 | 1 | < 0.1% | |
| 47490.97 | 1 | < 0.1% | |
| 47490.73 | 1 | < 0.1% | |
| 47490.66 | 1 | < 0.1% | |
| 47483.99 | 1 | < 0.1% | |
| 47481.13 | 1 | < 0.1% | |
| 47480.98 | 1 | < 0.1% |
cardPresent
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 627.0 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) | |
| False | 340453 | 53.0% | |
| True | 301461 | 47.0% |
expirationDateKeyInMatch
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 627.0 KiB |
| False | |
|---|---|
| True | 969 |
| Value | Count | Frequency (%) | |
| False | 640945 | 99.8% | |
| True | 969 | 0.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| accountNumber | customerId | creditLimit | availableMoney | transactionDateTime | transactionAmount | merchantName | acqCountry | merchantCountryCode | posEntryMode | posConditionCode | merchantCategoryCode | currentExpDate | accountOpenDate | dateOfLastAddressChange | cardCVV | enteredCVV | cardLast4Digits | transactionType | isFraud | currentBalance | cardPresent | expirationDateKeyInMatch | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 733493772 | 733493772 | 5000 | 5000.00 | 2016-01-08T19:04:50 | 111.33 | Lyft | US | US | 05 | 01 | rideshare | 04/2020 | 2014-08-03 | 2014-08-03 | 492 | 492 | 9184 | PURCHASE | True | 0.00 | False | False |
| 1 | 733493772 | 733493772 | 5000 | 4888.67 | 2016-01-09T22:32:39 | 24.75 | Uber | US | US | 09 | 01 | rideshare | 06/2023 | 2014-08-03 | 2014-08-03 | 492 | 492 | 9184 | PURCHASE | False | 111.33 | False | False |
| 2 | 733493772 | 733493772 | 5000 | 4863.92 | 2016-01-11T13:36:55 | 187.40 | Lyft | US | US | 05 | 01 | rideshare | 12/2027 | 2014-08-03 | 2014-08-03 | 492 | 492 | 9184 | PURCHASE | False | 136.08 | False | False |
| 3 | 733493772 | 733493772 | 5000 | 4676.52 | 2016-01-11T22:47:46 | 227.34 | Lyft | US | US | 02 | 01 | rideshare | 09/2029 | 2014-08-03 | 2014-08-03 | 492 | 492 | 9184 | PURCHASE | True | 323.48 | False | False |
| 4 | 733493772 | 733493772 | 5000 | 4449.18 | 2016-01-16T01:41:11 | 0.00 | Lyft | US | US | 02 | 01 | rideshare | 10/2024 | 2014-08-03 | 2014-08-03 | 492 | 492 | 9184 | ADDRESS_VERIFICATION | False | 550.82 | False | False |
| 5 | 733493772 | 733493772 | 5000 | 4449.18 | 2016-01-16T21:35:27 | 9.80 | Fresh eCards | US | US | 05 | 01 | online_gifts | 02/2021 | 2014-08-03 | 2014-08-03 | 492 | 492 | 9184 | PURCHASE | False | 550.82 | False | False |
| 6 | 733493772 | 733493772 | 5000 | 4439.38 | 2016-01-24T07:54:01 | 247.99 | Uber | NaN | US | 05 | 01 | rideshare | 01/2026 | 2014-08-03 | 2014-08-03 | 492 | 492 | 9184 | PURCHASE | False | 560.62 | False | False |
| 7 | 733493772 | 733493772 | 5000 | 4191.39 | 2016-01-26T05:28:24 | 0.00 | Universe Massage #95463 | US | US | 05 | 01 | personal care | 12/2031 | 2014-08-03 | 2014-08-03 | 492 | 492 | 9184 | ADDRESS_VERIFICATION | False | 808.61 | False | False |
| 8 | 733493772 | 733493772 | 5000 | 4191.39 | 2016-01-26T12:18:14 | 11.54 | Universe Massage #70014 | US | US | 05 | 01 | personal care | 04/2024 | 2014-08-03 | 2014-08-03 | 492 | 492 | 9184 | PURCHASE | False | 808.61 | True | False |
| 9 | 733493772 | 733493772 | 5000 | 4179.85 | 2016-01-26T12:19:15 | 11.54 | Universe Massage #70014 | US | US | 05 | 01 | personal care | 04/2024 | 2014-08-03 | 2014-08-03 | 492 | 492 | 9184 | REVERSAL | False | 820.15 | True | False |
Last rows
| accountNumber | customerId | creditLimit | availableMoney | transactionDateTime | transactionAmount | merchantName | acqCountry | merchantCountryCode | posEntryMode | posConditionCode | merchantCategoryCode | currentExpDate | accountOpenDate | dateOfLastAddressChange | cardCVV | enteredCVV | cardLast4Digits | transactionType | isFraud | currentBalance | cardPresent | expirationDateKeyInMatch | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 641904 | 186770399 | 186770399 | 7500 | 3619.56 | 2016-11-04T01:33:34 | 5.37 | Apple iTunes | US | US | 05 | 08 | mobileapps | 01/2030 | 2015-11-04 | 2016-06-03 | 127 | 127 | 5432 | PURCHASE | False | 3880.44 | False | False |
| 641905 | 186770399 | 186770399 | 7500 | 3614.19 | 2016-11-07T20:48:59 | 147.97 | Blue Mountain Online Services | US | US | 02 | 01 | online_gifts | 05/2030 | 2015-11-04 | 2016-06-03 | 127 | 127 | 5432 | PURCHASE | False | 3885.81 | False | False |
| 641906 | 186770399 | 186770399 | 7500 | 3466.22 | 2016-11-12T11:02:33 | 883.79 | Fresh Online Services | US | US | 09 | 01 | online_gifts | 11/2029 | 2015-11-04 | 2016-06-03 | 127 | 127 | 5432 | PURCHASE | False | 4033.78 | False | False |
| 641907 | 186770399 | 186770399 | 7500 | 2582.43 | 2016-11-17T06:45:58 | 16.31 | abc.com | US | US | 09 | 08 | online_subscriptions | 11/2029 | 2015-11-04 | 2016-06-03 | 127 | 127 | 5432 | PURCHASE | False | 4917.57 | False | False |
| 641908 | 186770399 | 186770399 | 7500 | 2566.12 | 2016-11-18T19:50:45 | 17.10 | Next Day Online Services | US | US | 05 | 01 | online_gifts | 05/2025 | 2015-11-04 | 2016-06-03 | 127 | 127 | 5432 | PURCHASE | False | 4933.88 | False | False |
| 641909 | 186770399 | 186770399 | 7500 | 2574.02 | 2016-12-04T12:29:21 | 5.37 | Apple iTunes | US | US | 05 | 08 | mobileapps | 01/2030 | 2015-11-04 | 2016-06-03 | 127 | 127 | 5432 | PURCHASE | False | 4925.98 | False | False |
| 641910 | 186770399 | 186770399 | 7500 | 2568.65 | 2016-12-09T04:20:35 | 223.70 | Blue Mountain eCards | US | US | 09 | 01 | online_gifts | 05/2026 | 2015-11-04 | 2016-06-03 | 127 | 127 | 5432 | PURCHASE | False | 4931.35 | False | False |
| 641911 | 186770399 | 186770399 | 7500 | 2344.95 | 2016-12-16T07:58:23 | 138.42 | Fresh Flowers | US | US | 02 | 01 | online_gifts | 10/2019 | 2015-11-04 | 2016-06-03 | 127 | 127 | 5432 | PURCHASE | False | 5155.05 | False | False |
| 641912 | 186770399 | 186770399 | 7500 | 2206.53 | 2016-12-19T02:30:35 | 16.31 | abc.com | US | US | 09 | 08 | online_subscriptions | 11/2029 | 2015-11-04 | 2016-06-03 | 127 | 127 | 5432 | PURCHASE | False | 5293.47 | False | False |
| 641913 | 186770399 | 186770399 | 7500 | 2190.22 | 2016-12-28T11:14:14 | 32.53 | Next Day Online Services | US | US | 09 | 01 | online_gifts | 08/2025 | 2015-11-04 | 2016-06-03 | 127 | 127 | 5432 | PURCHASE | False | 5309.78 | False | False |